Adaptive spectral analysis for speech-sound recognition
نویسندگان
چکیده
منابع مشابه
Signal adaptive spectral envelope estimation for robust speech recognition
This paper describes a novel spectral envelope estimation technique which adapts to the characteristics of the observed signal. This is possible via the introduction of a second bilinear transformation into warped minimum variance distortionless response (MVDR) spectral envelope estimation. As opposed to the first bilinear transformation, however, which is applied in the time domain, the second...
متن کاملAnalysis of Speech Recognition Techniques for use in a Non-Speech Sound Recognition System
This paper discusses the use of speech recognition techniques in non-speech sound recognition. It analyses the different techniques used for speech recognition and identifies those that can be used for non-speech sound recognition. It then performs benchmarks on two of these techniques (LVQ and ANN’s) and determines which technique is better suited for non-speech sound recognition. As a compari...
متن کاملComparison of spectral analysis methods for automatic speech recognition
In this paper, we evaluate the front-end of Automatic Speech Recognition (ASR) systems, with respect to different types of spectral processing methods that are extensively used. Experimentally, we show that direct use of FFT spectral values is just as effective as using either Mel or Gammatone filter banks, as an intermediate processing stage, if the cosine basis vectors used for dimensionality...
متن کاملOn frequency averaging for spectral analysis in speech recognition
Many speech recognition systems use logarithmic filter-bank energies or a linear transformation of them to represent the speech signal. Usually, each of those energies is routinely computed as a weighted average of the periodogram samples that lie in the corresponding frequency band. In this work, we attempt to gain an insight into the statistical properties of the frequency-averaged periodogra...
متن کاملIntrinsic Spectral Analysis for Zero and High Resource Speech Recognition
The constraints of the speech production apparatus imply that our vocalizations are approximately restricted to a lowdimensional manifold embedded in a high-dimensional space. Manifold learning algorithms provide a means to recover the approximate embedding from untranscribed data and enable use of the manifold’s intrinsic distance metric to characterize acoustic similarity for downstream autom...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Audio and Electroacoustics
سال: 1968
ISSN: 0018-9278
DOI: 10.1109/tau.1968.1162015